NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Is Programming by Examples Solved by LLMs?

Li, Wen-Ding; Ellis, Kevin (December 2025, NeurIPS)

Full Text Available
Doing Experiments and Revising Rules With Natural Language and Probabilistic Reasoning

Piriyakulkij, Top Wasu; Langenfeld, Cassidy; Le, Tuan-Anh; Ellis, Kevin (December 2025, NeurIPS)

Full Text Available
PoE-World: Compositional World Modeling with Products of Programmatic Experts

Piriyakulkij, Wasu Top; Liang, Yichao; Tang, Hao; Weller, Adrian; Kryven, Marta; Ellis, Kevin (December 2025, NeurIPS)

Full Text Available
Synthesizing theories of human language with Bayesian program induction

https://doi.org/10.1038/s41467-022-32012-w

Ellis, Kevin; Albright, Adam; Solar-Lezama, Armando; Tenenbaum, Joshua B.; O’Donnell, Timothy J. (December 2022, Nature Communications)

Abstract Automated, data-driven construction and evaluation of scientific models and theories is a long-standing challenge in artificial intelligence. We present a framework for algorithmically synthesizing models of a basic part of human language: morpho-phonology, the system that builds word forms from sounds. We integrate Bayesian inference with program synthesis and representations inspired by linguistic theory and cognitive models of learning and discovery. Across 70 datasets from 58 diverse languages, our system synthesizes human-interpretable models for core aspects of each language’s morpho-phonology, sometimes approaching models posited by human linguists. Joint inference across all 70 data sets automatically synthesizes a meta-model encoding interpretable cross-language typological tendencies. Finally, the same algorithm captures few-shot learning dynamics, acquiring new morphophonological rules from just one or a few examples. These results suggest routes to more powerful machine-enabled discovery of interpretable models in linguistics and other scientific domains.
more » « less
Full Text Available
Top-Down Synthesis for Library Learning

https://doi.org/10.1145/3571234

Bowers, Matthew; Olausson, Theo X.; Wong, Lionel; Grand, Gabriel; Tenenbaum, Joshua B.; Ellis, Kevin; Solar-Lezama, Armando (January 2023, Proceedings of the ACM on Programming Languages)

This paper introduces corpus-guided top-down synthesis as a mechanism for synthesizing library functions that capture common functionality from a corpus of programs in a domain specific language (DSL). The algorithm builds abstractions directly from initial DSL primitives, using syntactic pattern matching of intermediate abstractions to intelligently prune the search space and guide the algorithm towards abstractions that maximally capture shared structures in the corpus. We present an implementation of the approach in a tool called Stitch and evaluate it against the state-of-the-art deductive library learning algorithm from DreamCoder. Our evaluation shows that Stitch is 3-4 orders of magnitude faster and uses 2 orders of magnitude less memory while maintaining comparable or better library quality (as measured by compressivity). We also demonstrate Stitch’s scalability on corpora containing hundreds of complex programs that are intractable with prior deductive approaches and show empirically that it is robust to terminating the search procedure early—further allowing it to scale to challenging datasets by means of early stopping.
more » « less
Full Text Available
Program Synthesis with Pragmatic Communication

Pu, Yewen; Ellis, Kevin; Kryven, Marta; Tenenbaum, Josh; Solar-Lezama, Armando (December 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Program Synthesis with Pragmatic Communication

Pu, Yewen; Ellis, Kevin; Kryven, Marta; Tenenbaum, Josh; Solar-Lezama, Armando (December 2020, Advances in neural information processing systems)

Full Text Available
Neurosymbolic Programming

https://doi.org/10.1561/2500000049

Chaudhuri, Swarat; Ellis, Kevin; Polozov, Oleksandr; Singh, Rishabh; Solar-Lezama, Armando; Yue, Yisong (January 2021, Foundations and Trends® in Programming Languages)

Full Text Available
Neurosymbolic Programming

https://doi.org/10.1561/9781680839357

Chaudhuri, Swarat; Ellis, Kevin; Polozov, Oleksandr; Singh, Rishabh; Solar-Lezama, Armando; Yue, Yisong (January 2021, Foundations and trends in programming languages)

Full Text Available
DreamCoder: bootstrapping inductive program synthesis with wake-sleep library learning

https://doi.org/10.1145/3453483.3454080

Ellis, Kevin; Wong, Catherine; Nye, Maxwell; Sablé-Meyer, Mathias; Morales, Lucas; Hewitt, Luke; Cary, Luc; Solar-Lezama, Armando; Tenenbaum, Joshua B. (June 2021, International Conference on Programming Language Design and Implementation)

We present a system for inductive program synthesis called DreamCoder, which inputs a corpus of synthesis problems each specified by one or a few examples, and automatically derives a library of program components and a neural search policy that can be used to efficiently solve other similar synthesis problems. The library and search policy bootstrap each other iteratively through a variant of "wake-sleep" approximate Bayesian learning. A new refactoring algorithm based on E-graph matching identifies common sub-components across synthesized programs, building a progressively deepening library of abstractions capturing the structure of the input domain. We evaluate on eight domains including classic program synthesis areas and AI tasks such as planning, inverse graphics, and equation discovery. We show that jointly learning the library and neural search policy leads to solving more problems, and solving them more quickly.
more » « less
Full Text Available

« Prev Next »

Search for: All records